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DETECTION AND RECOGNITION OF SIMPLE SPATIAL FORMS. 


Andrew B. Watson 
NASA Ames Research Center 
Moffett Field. CA 94035, USA 


1. Models in Vision 

In spite of the large number of intelligent, energetic people engaged in the study of vision, it often 
seems that we advance our understanding of this process at an excruciatingly slow pace. Why is this 
so? 1 believe that it is the fault of our models. We have been seduced by the simplicity of explana- 
tion available in the physical sciences, and try to describe the vast complexity of vision with models 
that would not do justice to a sphere rolling down an inclined plane. 

Today 1 will describe a model which in a small way attempts to remedy this situation, ihe 
principles that undcrly it arc simple, but it attempts to represent more adequately the inherent com- 
plexity of vision. Before describing this model in detail, 1 will emphasize its unusual aspects, and 
some of the benefits we may gain from this departure. 

First, the model is explicit. No "channels" "mechanisms" or other ill-defined entities appear. 
Put another way, the model is computable. It is well enough defined to permit numerical predic- 
tions to be made for a given experimental situation. Second, the model adequately represents all the 
dimensions that are pertinent to the cx|)erimcntal domain. In the case 1 consider, these arc the two 
spatial dimensions of a monochromatic, stationary image. Although this is an inherently two- 
dimensional (2D) situation, visual models in this context have almost invariably been one- 
dimensional { ID). Third, the model is general within its specified domain. The same model can be 
used to predict performance in a wide variety of different of visual tasks on a wide variety of images. 
This contrasts with most visual models, which only attempt to explain the data from a very res- 
tricted domain. Fourth, the model draws many of its parameters and assumptions directly from phy- 
siological results. While this has long been an ambition of psychophysical theories, it has rarely 
been realized. Finally, the model attributes intelligence to the observer. It is perhaps understand- 
able that this has not been done in the past, since psychophysicists arc most often their own 
observers. The model assumes that the observer will make optimal use of the available information. 
This assumption is of course a commonplace in signal detection theory [10], but rarely appears in 
explicitly visual models. 

'The price of these amendments is a model that is somewhat more complex, requiring more 
assumptions and parameters than average. The benefits, however, arc worth i; It provides a 
mechanism with which to integrate information within the field. Since the model can be applied in 
diverse contexts, it provides a common repository for results from many different sources. The 
model also insures the consistency of interpretation from one experiment to the next. Too often a 
model is constructed in one context that is quite obv iously incompatible with data from another con- 
text. The model also provides a natural path along whicli to specify in ever greater detail the rela- 
tion between our visual experience and the physiological mechanisms of the visual brain. T'inally, to 
the extent that the model is successful, it will permit us to turn our attention from the early, 
image-driven stages of vision to the more complex, cognitive processing that must subsequently 
occur. 


It should be clear that the argument I have advanced is on behalf of any model that meets the 
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criteria of explicitness, adequacy, and generality noted above, llie model I will describe is just a 
candidate, and a green one at that. 

2. Domain of the model 

What is the domain of my model? It is intended to account for psychophysical responses of an 
accommodated, fixating human observer viewing binocularly a 2D stationary achromatic image at a 
fixed adapting level of 340 edm"^ Specifically, I have excluded both color, stereo, and dynamic 
imagery since it seems likely that they are analyzed independently of spatial luminance contrast. 

3. General properties of the model 

Since this model is a first approximation whose parameters and structural details arc quite open to 
improvement, it is worth giving a brief overview of its essential structure. The heart of the model is 
a set of feature sensors, which perform different measurements upon the input image. The set of 
measurements taken from a particuhy image make up a feature vector. I'he sensors arc perturbed 
by noise, so that from presentation to presentation, the same image will give rise to somewhat 
different feature vectors. Following each trial of an experiment, the observer processes the feature 
vector in an optimal way to arrive at a psychophysical decision. 

The model thus divides naturally into two parts: generating the feature vector, and processing 
the feature vector. 

4. Generating the Feature Vector 

Kach feature sensor is defined by a spatial weighting function, which is a model or template of the 
feature to be sensed. The measurement is performed by cross-correlating the contrast image and 
the weighting function. The features used in the model arc typified by the pattern in Fig. 1( top). It 
is the product of a 2D sinusoid and a 2D Gaussian. Its various parameters, which may differ from 
sensor to sensor, are: spatial frequency (the frequency orthogonal to the bars), the orientation (the 
angle of the orthogonal to the burs), the width (the size at half height of the Gaussian defined 
orthogonal to the bars), the height (the comparable measure parallel to the bars), and the phase of 
the sinusoid (defined relative to the center of the Gaussian). Finally, each function is located at a 
piiriicular point in the visual held. Of these seven parameters, we shall sec that two can be defined 
in terms of the others, leaving five feature dimensions that may vary from sensor to sensor. 

'niis general form of weighting function closely resembles the receptive field profiles of sim- 
ple cells, the most numerous class of visual neurons in tlic striate cortex of cat and monkey [3,13]. 
Most cortical cells respond only over a modest region of space, a modest band of spatial frequencies, 
and a modest range of orientations [3.4, 15,24). Psychophysical data arc also consistent with a 
moderate selectivity in space, spatial frequency and orientation [1,29,32]. The Fourier transform of 
tlic feature is shown in Fig. 1( bottom), where it can be seen that the sensor also responds only over 
a small band of spatial frequencies and orientations. 

This sort of function is often named after GABOR, who showed that (in the ID case) it 
minimizes the width in both space and frequency [ 8 ]. DAUGMAN has noted the virtues of the 2D 
version of the Gabor function [2]. 

4.1. Width and Height 

ITic data of DKVAI.OIS et aL [3] indicate simple cortical cell receptive fields tend to be slightly 
taller than they arc wide, but including this subtlety did not seem worth the extra computational 
effort. Accordingly, 1 have equated height and width, so the supports of both pattern and transform 
arc circular, lliis also allows us to specify the width of a pattern, or of its transform, by a single 
number: the diameter at half height. With this amendment, the weighting function for the sensor 
can be written 

w'(jt.y) = co^[2nf{xcos0 + >'sind) + ip] ( 1 ) 

Where / is the spatial frequency, 0 is the orientation, tv is the width, and 9 is the phase. 

I'he width of a feature and the bandwidth of its transform are inversely related 
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Fig.t Top: an example of a spatial feature consisting of the product ot .1 MMll^old aiul .1 ( iaiissian. 
The two axes are hoii/.ontal and vertical visual field position. Widili of the !c,cjirc is I 124 cycles of 
the sinusoid. !''ic orientation is 22.5 degrees, and the phase is 0. huiiom the rnniiri iransfonn of 
the feature. Hie axes are liori/onlal and vertical spatial iVeiiuency. lo|) and hottum liguies are to 
scale when 1 degree on the top scale equals 1 cyclc/degree on the boitom sc; le 


(bandwidth = 4 lii2/( wwidth)). I have set the width of each sensor to alx'iii I <24 ( ycles of the 
sinusoid. This iiicmiis that sensor bandwidth is proportion.il to sensor sn.in.d ii.i|u^ia> (bandwidth 
= 2/3 frequency), or. in logarithmic terms, that c;ich sensin' has a b.mdwidlh o( one octave. It also 
means that a low frequency sensor will be large, and a high frequency sensoi will be siii.ill. 

Spatial frequency bandwidths for simple striate cells rise approximately in proportion to fre- 
quency. Handwidths appeiir to range from one half to 2.5 iKtaves, but one ot lave h;iiidwidths 
comprise a l.irge fmciion of those recorded (3,15|. Fsyc'iophysic.il d.it.i do not h.id so directly to 
estimates of bandwidth, but they arc consistent with a proportion. ihty beiwcen lieqiiency and 
bandwidth, .md with log bandwidths of about one octave 129,30,31). 

4.2. Freipieney 

Since each sensor covers only one octave of frequency, wc need a mimbei of sensoi s to cover the 
full range of lict|iiency which the human can sense (.ibout 0 to 60 cycKs/tUgis e) Hie number 
required will .ilso depend upon the density of the sensors in the fietinency il. in nn I his cannot be 
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less than about one sensor per bandwidth, or information in the image will be lust, and sensitivity 
will show dips between sensors [29], In the model I have tried to include as few sensors as possible, 
so just one sensor per bandwidth has been used. Iliis yields a set of eight basic sensors, whose fre- 
quencies range from 0.25 to 32 cyclcs/degree in octave steps. This basic set is shown in Fig.2. I'he 
largest sensor, a small part of which is barely visible in the upper left corner, is 128 times larger 
than the smallest. Ibe sensors have been arranged in a spiral so as to ht in a square, but should all 
be considered to lie at the center of the visual field. 'Phis basic set will be present on’y at this point; 
the sensors at other locations will be generated by a rule described below. 

'fhe transforms of this basic set arc shown in Fig.3 For clarity, only one of the two parts of 
each spectrum is shown. The positions of the spectra arc correctly placed relative to the correspond- 
ing features in Fig.2. Note that the smallest feature corresponds to the largest transform. For clar- 
ity 1 have shown features with different orientations. If all the features were similarly oriented, all 
the transfonns would lie at a single angle, and would overlap considerably. This illustrates that all 
of frequency space is covered. 



Flg.2 The basic set of sensors. 'I'hc sensor frequencies range from 0.25 to 32 cyclcs/dcgicc in oc 
tave steps. F.ach sensor has a bandwidth of one octave (width = 1.324 cycles). 
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l'ig.3 Fourier transforms of the set of basic sensors. Of the two Gaiissians synimciiically placed 
about the origin, only one is shown. Axes are horizontal and vertical spatial frequency. 


4.3. Spatial Sampling 

To preserve the information in the image the ID spatial density of the sensors of a particular fre- 
quency must be at least twice the frequency extent of the sensor. This extent is not well defined for 
a Gaussian spectrum, but if we used the bandwidth, the required sample frequency would he 4/3 the 
sensor frequency. I have used the more conservative factor of 1.8. Tluis for the hasic set of eight 
frequencies at the fovea, sampling intervals range from 0.0174 degrees for the highest frequency to 
2.22 degrees for the lowest. 

4.4. Orientation 

The orientation bandwidth of a sensor (as measured with the spatial frcqiu'ncy to which the sensor 
is tuned) is completely determined by the spatial frequency bandwidth, lor our one (Ktavc spatial 
frequency bandwidth, the orientation bandwidth is about 38 degrees. I'pis can he seen in Fig.3, 
where the orientation bandwidth will be given by the range of angles that mtcrscci the spectrum at 
half-height or above. This 38 degree figure agrees well with the 40 degrees icfx)itcd hy DFVAI.OIS 
el aL [4] as the median orientation bandwidth for simple cells in monkey siiiaie cortex. There is a 
great deal of psychophysical data related to orientation tuning, but as yet there has been no convinc- 
ing route from the data to an estimate of the orientation bandwidths of underlying sensors. 
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In spoi irvini’ sensor density in the oricntiition domnin I have again followed the nde of about 
one sensor pei handwulth, this yields the set of fi\e orientations illostialed in I ig.4. The 
transforms of lesc leatures arc shown in l'ig.5. Successive transforms are overlaid, and each is 
truncated at about one width, illustrating the degree of sensor overlap in the domain of orientation. 

4.5. Phase 

I include sensois of two phases, 0 and 90 degrees. At least two phases are required to preserve the 
odd and even intormation in the image. Beyond this, POl.l l-.N and KONNI'K 119] have l\)und that 
adjacent simple eells in cat striate cortex usually share a common spatial lieqiiency and i)i ientation, 
but dill'ci in phase by 90 degrees. I'xisling psychophysical data do not lead directly to any strong 
hypothesis leg.iuling sensor phase. 

4.6. .Spatial \nisoplaiiatism 

Perhaps the giealcst ililliculty in .’h modeling of human sp.itial vision arc the large ami complex 
variations in siiaiial processing that occur with distance from the center of the visual field. In gen- 
eral. contiasi seiiMiivity declines with eccentricity, and does so more rapidly for high .pati.il frequen- 
cies than foi low (I I., ’0.271, However it has recently been suggested that s[)atial processing in all 
portions of the \kimI Meld is identical s.ivc Mm a ch.inge in the ?D spatial sc.ile 15, 1 1,22). This 



l-iU^ fhe live sensor orientations. I hey range from 0 to 144 degrees in steps of ^6 degrees. 
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Fijj.5 rniiisfonns of the five sensor oricntiitions. 1-^icli G.iiissi.ui h;is hcon tiiiiu.iU il .n mu' wklth for 
clarity, flic oiicru.ition b.indwidth of each sensor is alioiit dcgiocs. 


would be the case if ganglion cells changed in size with eccentricity, but all miIimhiuuiI iiioccssing 
were homogeneous with respect to eccentricity. 1 hough the cnideiice tor ibe. Ii\ puiliesis is not 
overwhelming, it i« too great a simplification to be resisted. Accordinglv I li.oe i.iken the spatial 
scale s to be given by the scaling function 

s = \ f kc (2) 

where ^ is a constant and e is eccentricity in degrees, l or iiiy own eyes, A is .ibuiii ti 4 I bis agrees 
reasonably well with the estimates of other authors, and with the ch.mges in g.inglion lell density 
and cortical magnification factor with eccentricity 15,2 1], It should be cmpliasi/ed ili.it this function 
is intended only as a first approximation. 

This scaling function has been used to scale the si/c, frequenev, and sp.mil density of the 
sensors at c.ich eccentricity. Iluis if a sensor at the fovea h.is a basic fieqiieiK ^ ol r width iv. and 
11) density </, the related sensor at eccentricity c will have frequency / /s, width \w. .nul density 
d/s. These relationships arc shown in l-'ig.t). where I have drawn a s.im|thng .ni.iv loi ,i basic fre- 
quency of 1 cyelc/dcgrce. Ivtich circle in the figure is about 2/5 the width ol the i nnesponding sen- 
sor. showing the considerable degree of spatial overlap among sensors o( a comnion liequcncy. 
Note that as eccentricity increases, sensor frcqtiency and sp.itial density aie scaled by the same 
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l'iK.6 The sampling array for a basic frequemey of 1 cycic/dcgrcc. 1110 two axes show degrees from 
the fovea, r.ich circle is about 2/5 the width of the corresponding sensor. 


amount, so that the sampling requirements continue to be met. The set of all sensors generated in 
this way from a given basic frequency may be considered a family. I'his is the closest we get to a 
"spatial frequency channel" of traditional visu.il theory. 

Ilic actual positions of individual sensors h.ive been determined by finding, for each family at 
each ecceniiiiity. lh.it regiil.ir polygon centered upon the fovea whose sides are about equal in 
length to the dilference between the present and next lesser eccentricity. Ibis algorithm is used 
only as a simple w.iy of generating sensor locations with the appropriate properties. 

4.7. .Sensor Gain 

It is well known that contrast sensitivity depends upon spatial frequency. How do we incorporate 
this dependence into the model? I begin by dclining sensor ^ain as the response to a matched target 
at unit contras’. If sensor gain at the fovea is /). where / is the basic frequency, then spatial 
processing will be homogeneous irrespective of eccentricity only if sensor gain is adjusted to take 
into account the larger si/e of eccentric sensors. This can be accomplished by multiplying each 
weighting function by /) 16 /;i2/( w), where w is the actual width and / is the basic frequency 

of tlie family to which the sensor belongs. This scaling of gain would result naturally if each sensor 
received contributions from an cqtial number of receptors, and this would be likely if the receptor 


density followed the same scaling fiinclion given by (2). 
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Ihc actual functional form lor v(/) is not simple to estimate, since it is not gi\en directly 
by any simple empirical measurement. A first approximation, indirectly csliiii.iteil (ioiii contras' 
sensiiivily to one octave bandwidth grating patches, is shown in Fig.?. 

4.8. romputing the Feature Vector 

Ihc preceding assumptions permit us to compute a feature vector, c.ich entry of which is the 
response of a single feature sensor. The many cross-correlations invoKed are gei.ei.illy more easily 
computed by way of well-known detours through the frequency domain. 10111101 s.ivings c.m he 
gained by disregarding sensors that arc well outside the space or frequency s..ppoit of the im.ige. 
l-iven so, the amount of computation required to obtain the feature vector may lie lormid.iiile, es[ie- 
cially for large stimuli with high frequency content. Idrtunalely the sensors .iiul .iH the images con- 
sidered here have I'ouricr transforms that can be obtained analvlically, so iltat [iiiite ti.mshirms nee 1 
not be resorted to. 

5. I’rocessing (he Featiiic Vector 

The psychological literature is filled with suggestitms as to how an I'hservci might make use of the 
feature vector to detect and disci iminaic among patterns. One might look foi ihe l.irgest single 
entry, look for the centroid of some distribution of vector entries, clieck whether at least one entry 
is larger than a criterion value, and so on. Hut these procedures are largely </</ /mr. aiul rarely gen- 
crali/c much beyond the experimental context they arc designed to explain, lor ex.ini|)le. looking 
for the largest single ciury might work at threshold but is not a reasonable rule above threshold. 
Also, these procedures arc nut derived from basic principles but rather appeal to the intuition, 
sometimes not very forcefully. 

In contrast, the statistical theory of pattern classiiication provides .1 logic. il basis for 



0.25 


32 


I'lg.l Sensor gain, x< /)■ 


C/deg 




OHiCiN^L ^ 

OF POOR QUALrTY 


consiruclmg optimal procedures with which to analyze the feature vector |fi, 14J. !'urthcrmorc piit- 
tern classification theory is general, and could in principle be applied to most tasks the human 
observer confronts in the psychophysical lab, if not in the world at large. 

5.1. 'flic Optimal Bayesian Classifier 

I Cl me illustrate how I apply this fheo’.y to the feature vector constructed a moment <”,o. I'irsl I 
assume that each sensor is |x?rtuibcd by zero mean, unit variance Gaussian i’.oise. I he feature vec- 
tor then has a multivariate normal density. I further assume that the covariance matrix of this den- 
sity is the identity matrix, which means that the sensors arc statistically independent. 

The job of the classifier is to examine the feature vector and decide which of several jxissiblc 
ima.gfs was m fact presented, l-ach of the possible images is associated willi a mean vrclor m*. 
rrcsentaiion of one of the images gives rise to a feature vector /.. If we also know the prior proba- 
bilitv of each .iltcvnalisc, Hayes’ rule allows us to use the feature vector to calculiite the posterior 
piobability of c.ich ol tlie altcmalives. In the cases I will consider, all alternatives have ec|ual priors. 
A reasonable, and m fact optimal rule is to choose ih image with highest posterior pioii.ibiliiy. 
SiiH e we c.ire only which has higliest prob.ibility, it is suflicicnt to calculate for each alternative any 
i|iiantity that is monotonie with posterior probability. Such functions on the feature vector arc called 
dnctimmanl.s. 

In the c.ise I have devclop-cd so far. a set of optimal discriminants irc 
i.\{/) (/ 111*)' (z III*) (3) 

I I 

where the suyK'iscript / denotes t!ie transpose of a matrix. Note that if / has n elements, then it 
c.in be considered a point in n dimcnsion.il space. I .kcwisc for each of the mean .ectois in*. I his 
lorm of discriminant tells us to pick the .'.Iternativc whose mean vector is closest to / in this space, 
lor this reason this is often called a minimum (/fv/m/tv classilicr. 

5.2. I’rior liiformation 

One objection to this scheme is that it assumes that the obscr. has perfect prior information 
regarding each mean vector in*. Since this sector is obtained through experience, and c\ieh experi- 
ence IS subject to saiiabihty. this assumption is unrealistic, l-mpirical data .ilso clearly show that the 
observer is less ili.iii optim.il. .is thougli he were uncertain regarding ni* |7,lf»,25). l•ortun.ltely, for 
the detection task I will consider, an .ipproximation is avail.ibic for a more realistic uncertain 
oliserver IlHJ. In the case oi discrimination, we must be content for the moment to examine the 
beh.iv ior of the ideal. Ibis is unfortunate, since to the extent that the sensors prese'^ve the informa- 
tion in the image, d'scrimination pertorm.mce tells us more about the orthogonality of the alterna- 
tive im.iges ib.in ibout the properties ol the sensors. 

5. t. Detection 

I he .ibility of ,m observ ei to dc;ect a p.ittein is often measured by a two interval forced choice 
(2II ( ) nn ihod. le. whn h the observer must judge which of two time intervals cont.iined a signal. 
Mils in.iy l>e viewed as .i disciimin.ition between two images; the test image and a iiull image. I rom 
a iiumber of tii.ils we can deteimine the proportion correct, or </'. defined here as v^2 times the nor- 
mal devi.ite of the proportion correct. Since there are only two alternatives (image in first interval 
or in second interval), we can diflerencc the two discriminants and use the sign of the result to 
choose an alternative. This quantity is normally distributed with a mean equal to the squ.ued length 
of the mean vector, and a variance of twice the squared length, l-rom this it is easy to show that for 
any given pattein, 

it' = villi' in (4) 



As noted above, this prediction assumes an observer with perfect prior information. Tor an 
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linccitain observer, performance can he approximated by specifying that for all stimuli at some fixed 
</'. 



where ^ is a constant of about 3.5 (18). This approximation has been use* in the following predic- 
tions of detection performance. 


The model gives a good account of sensitivity to a wide variety of difTerent spatial targets, 
l-or example, the model does a rcr<sonable job of predicting the contrast sensitivity runction at vari- 
ous CLcentiicities, and the effects of size of a grating p.ittern upon sensitivity. Among the earliest 
experiments to suggest that visual mechanisms were selective lor ipatiai frequency "-ere those which 
measuied sensitivity to mixtures of two frequencies |9.2.f). Data from a more recent version of this 
experiment arc shown in Fig.S (291. Hrreh point shows the sensitivity to a niixturc of two frequen- 
cies. relative to the sensitivity to either frequency alone. If 'be two ficquencies .idd line.irly. we get 
a ratio of two. if they do not ad.' at all. we get a ratio of I. 'i'lic data show a decline in summation 
as the two frequencies move faahcr apart. The prediction of the model is shown by the upper 
curve. It docs not agree precisely with the data, but given that most sources of eir'*r m ihc data will 
tend to lower the ratio it docs not do a bad job. I have also shown the prediction in tire ease of 0.8 
iKlave bandwidth sensors. I'hc fit is better, and this amendment may have to be resorted to in the 
future. 


5.4. Discrimination 

A particularly sensitive way of measuring discrimination performance is by a iwo-by-two forc'd 
choice (2X2I-C) method (I7.26.3I|. As in the 2H'C method, on each trial the observer is pi'’sented 
with an image in just one of two time intervals. Hut in this ease the image is selecteil at random 
fiom a set of two. The observer must choose both the interval containing the image, and wluch 



Frequency Ratio 


I'ig.S Summation between two different spatial frequencies. I’redictions of the model with 1.0 iK- 
i.ive and 0.8 octave bandwidths arc shown. Data are from (29). 
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image it was. The advantage of this technique is that it measures dctcctbn and discrimination con- 
currently, so the two sorts of performance can be compared. 

In the ease of 2X2FC llic observer must choose among fou'’ alternatives, so we must dctcr- 
m,(.c four discriminants. If we call the feature vector from the fir. t interval and that from the 
second interval 12 , and the mean feature vectors for the two images a and b, then these discrim- 
inants are 

du = a dia = tj a (5) 

^16 = 

where dib for example is the discriminant for image b in the second interval. Here we cannot 
reduce these to a single function of z as was possible in the ease of 2IFC. Furthermore, although 
each discriminant is again normally distributed, the four arc not independent, so we cannot easily 
derive the probability with which any one is the largest and must resort to Monte-Carlo tccliniqucs. 

5.4.1. Spatial Frequency 

Since our model observer discriminates among images on the basis of certain features, it is interest- 
ing to ask how well the human observer discriminates betwee,. two images that arc members of the 
feature set. Fig.9 shows some data collected with a 2X2FC method in which the two images to be 
discriminated were Gaussian-windowed sinusoids, like the features of the model (311. The ordinate 
plois the ratio of d' for discrimination and detection, which is a measure of discrimination perfor- 
mance relative to detection. This method of plotting 2X2FC data is due to THOMAS, wlio has 
done much of the pioneering work in this area. The ratio rises rapidly as the two frequencies are 
moved farther apart, so that when they differ by about an octave, we discriminate between them as 
well as we detect either one. ITic model’s performance, shown by the solid line, is better than the 
human observer. Introduction of uncertainty into the model would reduce this discrepancy. Notice 


gratings bars 



Frequency or Width Ratio 

Fig.9 Discrimination bct'‘'ccn grating patches of different frequencies (circles) and Gaussian bars of 
different widths (squares Jurves arc predictions of the model. Data arc from (31) and (28). 
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that, as a rule uf thumb, the model predicts a ratio of one when the two patterns diffci by about one 
bandwidth. 

5.4.2. Bar Width 

What if we repeat the experiment with images which do not resemble the sensor features? The 
square symbols in l'ig.9 are comparable data for discriminations between Gaussian bars: patterns 
with a Gaussian profile in both horizontal and vertical dimensions and with height always at least 
twice the width (28). Discrimination performance is vastly poorer, so that a difference of about 4 
octaves is required before discrimination is as good as detection. This poorer performance is very 
nicely described by the model, whose predictions arc shown by the rightmost line. 

5.4.3. Orientation 

1310 last set of data I will show you were collected by THOMAS and Gil l.l' (26]. They used a 
2X2FC method to discriminate between gratings of different orientations. The gratings were large, 
extending 15 cycles in each direction. Their data arc shown in Fig. 10. Discrimination improves 
rapidly as the difference in orientation increases, and is almost as gooc as detection when the 
difference is 10 degree- . The model’s simulated data arc shown by the eftmost solid line. The 
agreement is quite good; perhaps too good considering uncertainty has not I con included. 

On the basis of a ID model, THOMAS and GII.I.H estimated mechanism orientation 
bandwidths of 10.5 to 20.5 degrees, depending on observer, nicsc are 2 to 4 times narrower than 
the sensor bandwidths used in the prediction in Fig. 10. These authors acknowledged that their esti- 
mates were much narrower than bandwidths of cortical cells, and speculated that "psychophysically 
defined channels represent physiological mechanisms other than single cortical cells". 

How can the present model with 38 degree bandwidths perform as well as the THOMAS and 


Large grating 



Fig. 10 Discrimination between large gratings of different orientations (circles). Data are from [261. 
Curves show the predictions of the model for large and small gratings. 
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Gll.LE model with bandwidths about 3 times narrower? ITie most likely answer is that the present 
model is 2D, and can make intelligent use of the pattern of aetivity across a large number of spa- 
tially distributed sensors. When the stimulus is confined to a small area, performance deteriorates 
markedly, as shown by the rightmost curve in Fig. 10. (though much of this is due to the broadened 
orientation bandwidth of the stimulus itselO. 

These observations illustrate an important point: sensor properties cannot be estimated 
without an explicit, computable, 2D model of the sensors and of their distribution over the visual 
field. The model must also allow the observer to make intelligent use of the sensor outputs. This 
argument applies to any effort to derive sensor par^neters from psychophysical data, that is, to 
almost all psychophysical research. 
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